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Abstract. The inverse conjecture for the Gowers norms U'^{V) for finite-dimensional 
vector spaces V over a finite field F asserts, roughly speaking, that a bounded function 
/ has large Gowers norm ||/||c/<i(v) if Eind only if it correlates with a phase polynomial 
(j) = ep{P) of degree at most d — 1, thus P : V ^ ¥ is a. polynomial of degree at most 
d—1. In this paper, we develop a variant of the Furstenberg correspondence principle 
which allows us to establish this conjecture in the large characteristic case char(F) ^ d 
from an ergodic theory counterpart, which was recently established by Bergelson and 
the authors in [2]. In low characteristic we obtain a partial result, in which the phase 
polynomial (j) is allowed to be of some larger degree C{d). The full inverse conjecture 
remains open in low characteristic; the counterexamples in |13] . |15] in this setting can 
be avoided by a slight reformulation of the conjecture. 



1. Introduction 

1.1. The combinatorial inverse conjecture in finite characteristic. Let F be 

a finite field of prime order. Throughout this paper, F will be considered fixed (e.g. 
F = F2 or F = F3), and the term "vector space" will be shorthand for "vector space 
over F", and more generally any linear algebra term (e.g. span, independence, basis, 
subspace, linear transformation, etc.) will be understood to be over the field F. 

If is a vector space, / : ^ C is a function, and h & V is a shift, we define the 
(multiplicative) derivative A^f : ^ C of / by the formula 

A,/ := (T,/)7 

where the shift operator with shift h is defined by Thf{x) := f{x + h). An important 
special case arises when / takes the form / = e^l^P), where P : V — > F is a function, 
and cf : F ^ C is the standard character ef{j) := e^'^*-'/'"'' for j = 0, . . . , |F| — 1. In that 
case we see that Ahf = ef{AhP), where A^P : V ^ ¥ is the (additive) derivative of P, 
defined as 

AhP = ThP - P 

Given an integer ^ 0, we say that a function P : V ^ ¥ is a, polynomial of degree 
at most d if we have Ah^ . . . A/i^^^P = for all /ii, . . . , hd+i G V , and write Voiy^iy) 
for the set of all polynomials on V of degree at most d\ thus for instance Polyg(V") is 
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the set of constants, Poly]^(y) is the set of hnear polynomials on V, Poly2(^) is the 
set of quadratic polynomials, and so forth. It is easy to see that Poly^(V^) is a vector 
space, and if V = = {{xi, . . . , x„) : Xi, . . . ,Xn € F} is the standard n-dimensional 
vector space, then Poly^(K) has the monomials x^^ . . . x^" for ^ ii, . . . ,in < |F| and 
ii + . . . + in ^ d as a basi£|. 

We shall say that a function / : — > C is a phase polynomial of degree at most d if 
all {d + ly^ multiplicative derivatives A^-^ . . . A^^^^f are identically 1, and write Vd{V) 
for the space of all phase polynomials of degree at most d. We have the following 
equivalence between polynomials and phase polynomials in the high characteristic case: 

Lemma 1.2 (Phase polynomials are exponentials of polynomials). Suppose that ^ 
d < char(F), and f : V ^ C Then the following are equivalent: 

(i) feV,{V). 

(ii) / = e^'^'^CfiP) for some 6 G R/Z and P G Poly^(K). 

Proof. See P, Lemma D.5]. □ 

Remark 1.3. The lemma fails in the low characteristic case d ^ char(F); consider for 
instance the function / : F2 ^ C defined by /(I) := i and /(O) := 1. This function lies 
in P2(F2) but does not arise from a polynomial in Poly2(F2). 

Definition 1.4 (Expectation notation). If A is a finite non-empty set and f : A ^ C 
is a function, we write \A\ for the cardinality of A, and E^i/, /, or Ea.g^/(x) for the 
average ^Exga/(2^)- 

Definition 1.5 (Gowers uniformity norm). [B], [7] Let \^ be a finite vector space, let 
/ : — * C be a function, and let d ^ 1 be an integer. We then define the Gowers norm 
\\f\\u''{v) of / to be the quantity 

||/||^.(v^) := |E,,...,,, / A,, . . . A,jf/'\ 
Jv 

thus ||/||(7d+i(y) measures the average bias in d^^ multiplicative derivatives of /. We also 
define the weak Gowers norm \\f\\ud(v) of / to be the quantity 

||/||nd(y) := sup I / (1.1) 

</.ePd_i(V) Jv 

thus ||/||u£i(v') measures the extent to which / can correlate with a phase polynomial of 
degree at most d — 1. 

Remark 1.6. It can in fact be shown that the Gowers and weak Gowers norm are in 
fact norms for d ^ 2 (and seminorms for d = 1), see e.g. [7] or [2Tj. Further discussion 
of these two norms can be found in [lOj. In view of Lemma [T72| in the high characteristic 
case char(F) ^ d one can replace the phase polynomial G Vd-iiV) in (11.11) by the 
exponential ew{P) of a polynomial P G Poly^_i(V^). However, this is not the case in low 

""^The restriction ii, . . . , i„ < |F| arises of course from the identity x'*"' — x for all x e F. 
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characteristic. For instance, let ¥ = ¥2, V = Fg, and consider the symmetric function 
S4 : V ^ ¥2 defined by 

84(^X1, Xn) '■= ^ ^ XiXjXkXi- 

l^i<j<k<l^n 

Then the function / := (— l)"^" has low correlation with any exponential eF(P) = (— 1)^ 
of a cubic polynomial P G Poly3(l^) in the sense that Ea;gy/eF(— -P) = o„^oo(l) (see 
[T5] . [T3]): on the other hand, it is not hard to verify that the function 

g{x^,...,Xn) :=e2-^N/^ 

where denotes the number of indices 1 ^ j ^ n for which Xj = 1, lies in Vs{V) and 
has a large inner product with /; indeed, since f{x) = +1 when \x\ = 0, 1,2,3 mod 8 
and —1 otherwise, we easily check that 

X . —27vi —4:711 — Sttz — Stt^ — IOtt^ — 127rt — lAiri . , . 

Exey/^=o(l + ^ * ' ' ' ) + o„^oo(l) 

o 

= ^ h O„_oo(l)- 

In particular, we see that ||(— 1)'^*||m4(i/) is bounded from below by a positive absolute 
constant for large n. 



Let V := {z E : \z\ ^ 1} be the compact unit disk. This paper is concerned with the 
following conjecture: 

Conjecture 1.7 (Inverse Conjecture for the Gowers norm). Let ¥ he a finite field and 
let d^ 1 be an integer. Then for every 5 > there exists e > such that ||/||ud(y) ^ e 
for every finite vector space V and every function f : V ^ V such that ||/||c/'*(y) ^ ^• 

Remark 1.8. This result is trivial foT d = 1, and follows easily from Plancherel's 
theorem for d = 2. The result was established for ci = 3 in [10] (for odd characteristic) 
and [17] (for even characteristic), and a formulation of Theorem 11.91 was then conjectured 
in both papers, in which the phase polynomials were constrained to be char(F)**^ roots 
of unity. This formulation of the conjecture turned out to fail in the low characteristic 
regime char(F) + 1 < d (see [13|, [IS]); however, the counterexamples given there do 
not rule out the conjecture as formulated above in this case, basically because of the 
discussion in Remark 11.61 

The case when S was sufficiently close to 1 (depending on d) was treated in [Ij , while the 
case when char(F) is large compared to d and 6 was established in [18]. In [T3|, Theorem 
11.91 was also established in the case when / was a phase polynomial of degree less than 
char(F). These results have applications to solving linear systems of equations (and in 
particular, in finding arithmetic progressions) in subsets of vector spaces [12], [8] and 
also to polynomiality testing [17], [3]. Conjecture 11.71 is also the finite field analogue of 
a corresponding inverse conjecture for the Gowers norm in cyclic groups Z/A^Z, which 
is of importance in solving linear systems of equations in sets of integers such as the 
primes; see [TT], [Ij for further discussion. 
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The main result of this paper is to estabhsh this conjecture in the high characteristic 
case. 

Theorem 1.9 (Inverse Conjecture for the Gowers norm in high characteristic). Con- 
jecture [777| holds whenever char(F) > d. 

In the low characteristic case we have a partial result: 

Theorem 1.10 (Partial inverse Conjecture for the Gowers norm). Let ¥ be a finite 
field and let d ^ 1 he an integer. Then for every 5 > there exists e > such that 
^ ^ for every finite vector space V and every function f : V ^ T) such that 
\\f\\u'i{v) ^ ^! where k = C{d) depends only on d. 

Remark 1.11. One could in principle make the quantity k = C{d) in Theorem 11.101 
explicit, but this would require analyzing the arguments in p] in careful detail. One 
should however be able to obtain reasonable values of k for small d (e.g. d = A). 

The proofs of Theorems II. 9[ 11.101 rely on four additional ingredients: 

• An ergodic inverse theorem for the Gowers norm for F'^-systems (Theorems 
[rmOO]) . established in [2]; 

• The Furstenberg correspondence principle^Sj, combined with the random aver- 
aging trick of Varnavides|22j: 

• A statistical sampling lemma (Proposition 13. 13( ): and 

• Local testability of phase polynomials (Lemma 14.51) . essentially established in 

m 

Of these ingredients, the ergodic inverse theorem is the most crucial, and we now pause 
to describe it in detail. 

1.12. The ergodic inverse conjecture in finite characteristic. Let F"^ := IJ^o^'^ 
be the inverse limit of the finite-dimensional vector spaces F", where each F" is included 
in the next space F""*"^ in the obvious manner; equivalently, F"^ is the space of sequences 
{xi)°^i with Xi G F, and all but finitely many of the Xi non-zero. This is a countably 
infinite vector space over F. 

Definition 1.13 (F'^-system) . A -system is a quadruplet X = {X, {Tg)g^F^), 
where (X, B, fi) is a probability space, and T : h Th is a measure-preserving action 
of F'^ on X, thus for each h G F"^, Th : X X is a. measure-preserving bijection such 
that T/j o Tfc = Th+k for all h,k G F'^. Given any measurable : X — > C and h E W^, 
we define Thc/) : X ^ C to be the function Thcj) := (j) oT^, and Ahcj) : X ^ C to be the 
function Ah(f) := Thcj) ■ (p. We also define the inner product {f,g) := fx fd 
f,g E L^(X), where the Lebesgue spaces L^(X) = Wi^X.B, jj) are defined in the usual 
manner. We say that the system is ergodic if the only F'^-invariant functions on i^^(X) 
are the constants. 

Definition 1.14 (Phase polynomial). Let X = {X,B, ii,{Tg)g^^^) be an F'^-system, 
and let ^ 0. We say that a function cf) G L°°(X) is a phase polynomial of degree at 
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most d if we have A^-^ . . . Ah^^^cj) = 1 /x-a.e. for all hi, ... , h^+i G F"^. We let 'Pd(X) 
denote the space of all phase polynomials. 

Remark 1.15. By setting hi = . . . = hd+i = we see that every phase polynomial 
e VdO^) has unit magnitude: |0| = 1 /x-a.e.. 

Definition 1.16 (Gowers-Host-Kra seminorms). [13] Let X = {X,B,fi, (Tg)ggirc^) be a 
F'^-system, and let G L°°(X). We define the Gowers-Host-Kra seminorms ||0||i7d(x) 
for d ^ 1 recursively as follows: 

• If = 1, then ||0||[/i(x) 

• If > 1, then ||0||(7d(x) 

We also define the weak Gowers-Host-Kra seminorm ||0||„d(x) as 

||0L''(X) := sup 

V'ePd-i(x) 

Example 1.17. If G Vd-i(X.) is a phase polynomial of degree at most d — 1, then 

Remark 1.18. One can use the ergodic theorem to show that the limits here in fact 
converge, but we will not need this. The If^ are indeed seminorms, but we will not need 
this either. 

In pi, Corollaries 1.26,1.27], the following ergodic theory analogues of Theorems II. 9[ 
11.101 was shown: 

Theorem 1.19 (Inverse Conjecture for the Gowers-Host-Kra seminorm for high char- 
acteristic). Let X = {X,B,fi, {Tg)g(zfu:) be an ergodic W^-system, let char(F) ^ d ^ 1, 
and let G L°°(X) be such that ||0||[/<i(x) > 0. Then ||0||ud(x) > 0. 

Theorem 1.20 (Partial Inverse Conjecture for the Gowers-Host-Kra seminorm for 
general characteristic). Let X = {X,B,fi, (Tg)g(z^u,) be an ergodic -system, let d^ 1, 
and let G L°°(X) be such that ||0||;7d(x) > 0. Then ||0||ufc(x) > for some k = C{d) 
depending only on d. 

Remark 1.21. The "if" part of this theorem follows easily from van der Corput's 
lemma; the important part of the theorem for us is the "only if" part. These results 
can be viewed as a finite field analogue of the results in [13] in high characteristic (and 
a partial analogue in the low characteristic case), and indeed draws heavily on the tools 
developed in that paper; see [2J for further discussion. It is quite possible that k can in 
fact be taken to equal d in Theorem 1 1 . 201 (or equivalently, that the condition char(F) ^ d 
can be dropped in Theorem 11.191) : this would imply Conjecture 11.71 in full generality. 

We will use Theorem 11.201 as a "black box" , and it will be the primary ingredient in our 
proof of Theorem 11.101 in much the same way that the Furstenberg recurrence theorem 
is the primary ingredient in Furstenberg's proof of Szemeredi's theorem in p]. Theorem 
11.191 plays a similar role for Theorem 11.91 



:= limsup„_^ (^\\Eh(.w"T, 
:= limsup„^„ 



1/2 
1/2'' 



^h(p\\1jd-^x,f^,T) 



6 



TERENCE TAG AND TAMAR ZIEGLER 



As with any other argument using a Furstenberg-type correspondence principle, our 
bounds are ineffective, in that we do not obtain an exphcit value of e in terms of d 
and 6. In principle, one could finitise the arguments in [2J (in the spirit of [TH]) to 
obtain such an explicit value, but this would be extremely tedious (and not entirely 
straightforward), and would lead to an extremely poor dependence (such as iterated 
tower-exponential or worse). We will not pursue this matter here. 

1.22. Acknowledgments. The first author is supported by a grant from the MacArthur 
Foundation, and by NSF grant CCF-0649473. The second author is supported by ISF 
grant 557/08, by a Landau fellowship - supported by the Taub foundations, and by 
an Alon fellowship. The authors are also greatly indebted to Ben Green for helpful 
conversations, and Vitaly Bergelson for encouragement. 

2. Notation 

We will rely heavily on asymptotic notation. Given any parameters xi,...,Xk, we 
use Ox^,...,Xk{X) to denote any quantity bounded in magnitude by Ca;i,...,a;j.X for some 
finite quantity Cxi,...,xfc depending only on xi, . . . ,Xk. We also write Y <tixi,...,xk X or 
X ^xx,...,Xk ^ for ^ = Ox^,...,Xk{X). Furthermore, given an asymptotic parameter n that 
can go to infinity, we use o„^oo;xi,...,Xfc(-^) to denote any quantity bounded in magnitude 
by Cx^^,,,^Xk{n)X, where c^i,...,xfc(^) is a quantity which goes to zero as — > oo for fixed 
Xi, . . . , Xfc. Thus for instance, if r2 > ri > 1, then = Oj.2^oo;ri 

3. Statistical sampling 

It is well known that the "global average" Kh^vfW of ^ bounded function f : V ^ V 
can be accurately estimated (with high probability) by randomly selecting a number 
of points Xi, . . . ,xn G V and computing the empirical Monte Carlo average (or "local 
average") Ki^n^Nf^Xn)- Indeed, it is not hard to show (by the second moment method) 
that with probability OAr_>oo(l), one has 

Ei^„^7v/(a;„) = Ehevfih) + On-.ooC^)- 

The point here is that the error term is uniform in the choice of / and V. 

We now record some variants of this standard "random local averages approximate 
global averages" fact, in which we perform more exotic empirical averages. We begin 
with averages along random subspaces of V. 

Lemma 3.1 (Random sampling for integrals). Let V be a finite- dimensional vector 
space, and let f : V ^ V be a function. Let Vi, . . . ,Vm & V be chosen independently at 
random. Then with probability 1 — Om^ooi^), we have 

Ese¥"-fid-v) =Ehevf{h) +o^^oo(l) 
where v := (wi, . . . , Vm), and a ■ v := aiVi + . . . + amVm- 
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Remark 3.2. One can easily make the Om->oo(l) terms more explicit, but we will not 
need to do so here. 

Proof. We use the second moment method. Note that 

(the Om-»oo(l) error arising from the a = contribution) so by Chebyshev's inequality 
it suffices to show that 

The left-hand side can be rearranged as 

It is easy to see that the inner expectation is |E/igy/(/i)p unless a = cb, for some c G F 
in which case it is 0(1). The claim follows. □ 

In the above lemma, / was deterministic and thus independent of the Vi. But we can 
easily extend the result to the case where / depends on a bounded number of the vf. 

Corollary 3.3 (Random sampling for integrals, II). Let V be a finite- dimensional vector 
space, let m ^ mo ^ 0, let vi, . . . ,Vm & V be chosen independently at random, and let 
fvi,...,vm(, '■ V V be a function that depends on Vi, . . . ,Vmo but is independent of 
Vmo+iy ■ ■ ■ yVm- Thcn with probability 1 — Om^oo;mo(l); we have 

Proof We write a = (ao,ai) e F*^" x F™"™" and v = {vo,Vi) e V""' x V""-""". If 
we condition vq = {vi, . . . ,fmo) to be fixed, we see from applying Lemma 13.11 to the 
remaining random vectors Vi that for fixed oq, we have 

lEsigF^-^o (a ■ v) = Kh^vfvi,...,Vr„Q{(^0 ■ Vq + h) + Om,-mo^oo(l) 

with probability 1 — Om-mo->oo{^) conditioning on vq; integrating this we see that the 
same is true without the conditioning. We can shift h hy Sq ■ Vq, move the h average 
onto the other side, and take expectations to conclude that 

for each Sq; averaging over oq by the triangle inequality we obtain the claim. □ 

Remark 3.4. It is with this corollary that we are implicitly exploiting the highly tran- 
sitive nature of the symmetry group GL{V) available to us. In the setting of the cyclic 
group Z/A^Z, the analogue of Lemma l3.ll is still true, namely that one can approxi- 
mate a global average /^^^^ / by a local average on random arithmetic progressions of 
medium length, but this approximation no longer holds if / is allowed to depend on 
the ffist few values of that progression, since this of course determines the rest of the 
progression; this is related to the fact that (for prime, say), the affine group of Z/A^Z 
(which is analogous to GL{V)) is 2-transitive but no stronger. In contrast, in the finite 
field setting, a small subspace of a medium-dimensional subspace does not determine 
the whole subspace. 
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We will need to generalise these results further by considering more exotic averages 
along cubes. A typical result we will need can be stated informally as 



Jv 

^^h^M&v / f{ThJ){ThJ)Th^+hJ 
Jv 



(3.1) 



when mi is large, m2 is large compared with mi, and v is random (see Lemma [3.91 for 
the formal version of this type of estimate). Such results follow (heuristically, at least), 
by iterating the previous results. For instance, from Corollary 13.31 we heuristically have 

lEa2eF'"2lEsiGF™i / fiTaiVif)iTg^-V2f)iTg^-vi+d2V2f)ix) 

Jv 

~ E/ijev-EaigF™! / fiTs^-vif)iTh2f)iT^i-vi+h2f) 
Jv 

when 7712 is large compared to mi and then interchanging the expectations and applying 
Lemma 13.11 heuristically yields 

EhaGyEgjgF™! / f{Tdi-vif){Th2f){Tdivi+h2f) 
Jv 

Jv 

when mi is large, thus giving (13. ip . 

We will formalise the precise statement along these lines that we need later in this 
section. We begin with some key definitions. 

Definition 3.5 (Lipschitz norm). If G : ^ C is a function on a polydisk P", we 

define the Lipschitz norm IIGHup of G to be the quantity 

\G{z)-G{w)\ 

IIGIlLip := sup |G(2;)| + sup ^ 

z&V^ z,weV":z^w a[z,w) 

where we use the metric 

d{{zi, . . .,Zn), {Wi, . . .,Wn)) := | ^1 - Wl | + . . . + \Zn - W„|. 

Definition 3.6 (Accurate sampling sequence). Let k ^ 1, let be a finite-dimensional 
vector space, let / : — P be a bounded function, and let 

= i/o < ^1 < ^2 < ^3 < • • • 

be a sequence of integers (or "scales"). We define an accurate sampling sequence for / 
of degree k and at scales ifi, if2, • • • to be an infinite sequence of vectors 

wi,t;2,t;3, ---eV 

such that for every sequence 

^ To < ri < . . . < Tfc 
of scales and every Lipschitz function G : "pi^'i}''^'^ C, we have 

II^IIl' 

|G'/,ro,ri,...,rfc - Gf,ro\ ^ (3.2) 
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where 
where 

u= {di-vi,...,dk-Vk); Vj = {vi, . . . , VHr^), j = 0,...,k, 

and 

Gf,ro{x) := Eh,<.v,...,h^GvG{{f{x + uj-h + b- t^o))^e|o,i}fe,feeF«'-o 
where h = {hi, ... , h^). 

Remark 3.7. The denominator ri in fl3.2p could be replaced by any other fixed function 
of Ti that went to infinity as ri — > oo if desired here. 

Remark 3.8. We make the trivial but useful remark that an accurate sampling sequence 
of degree k is also an accurate sampling sequence of degree k' for any 1 ^ k' ^ k. Indeed, 
to verify for a function G' : V^^'^^" ^° ^ V and some scales r^/ > . . . > rg ^ 0, 
one simply adds some dummy scales Vk'+i, • • • , Tfc above r^/ and extends G' to a function 

composing with the obvious restriction map from T)^^'^^ 

to p{0.1}'='xF^'-o_ 



Roughly speaking, an accurate sampling sequence will allow us to estimate all the global 
averages that we need for the combinatorial inverse conjecture for the Gowers norm by 
local averages which are suitable for lifting to the ergodic setting via the correspondence 
principle. We illustrate the use of such sequences by describing the three special cases 
of (13. 2p that we will actually need in our arguments. 

Lemma 3.9 (Global Gowers norm can be approximated by local Gowers norm). Let 
d ^ 1, let V be a finite- dimensional vector space, let f : V ^ V be a bounded function, 
and let vi,V2, . . . E V be an accurate sampling sequence for f of degree d and at scales 
Hi, H2, .... Then for every sequence of scales 

< ri < r2 < . . . < 

we have 

^a^€F"-i,...,dae¥"-d / ^ai-t^ri • • ■ ^a^-VrJ = WfWuHV) + Ori-.oo;d(l)- 

Remark 3.10. As with all other estimates in this section, the point is that the error 
term is uniform over all choices of / and V . Note that the d = 2 case of this lemma is 
a formalisation of (13. ip . 

Proof. We apply (13. 2p with Tq = 0, and G : J)^^'^^'' C being the function 

G ((^(^)).e{o,i}^) := n C-^''-^-^^^) 

where C : z t-^z is the complex conjugation operator. A routine computation gives the 
identities 

G f,0,ri,...,ra{^) = ^dieW^'^i ,...,da&"''d^ai-Vr-^ ■ ■ ■ ^a^-Vr^f 
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and 



2'i 



V 



Gffi - \\j Wv^ivy 



Also, it is easy to see that the Lipschitz norm ||G||Lip is 0^(1). The claim now follows 
immediately from fl3.2p and the triangle inequality. □ 

Lemma 3.11 (Global averages can be approximated by local averages). Let V he a 
finite- dimensional vector space, let f : V ^ V be a bounded function, and let vi,V2, ■■■ E 
V be an accurate sampling sequence for f of degree 1 and at scales Hi,H2, . . .. Then 
for every finite sequence hi, . . . ,bm G and every continuous function F : V"^ — >• C, 
we have 

/ \Es^^HrTs.^g - / ^1 = 0^^^^ g 
Jv Jv 

where g : V C is the function 

g{x) := F(rg^../(x), . . • , Tg^.,^/(x)). (3.3) 



Proof. By approximating the continuous function F uniformly by a Lipschitz func- 
tion, we may assume that F is Lipschitz. By adding dummy vectors to the collection 
bi, . . . ,bm if necessary, we may assume that {bi, . . . ,bm} = E^''° for some tq > de- 
pending on bi, . . . ,bm, thus F is now a Lipschitz function from "^'^ to C. 

Note that to prove the claim we may without loss of generality restrict to the regime 
r > ro. We now apply with G : piO'ilx^"""" ^ C being the function 

G ((^(^,^))^g{0,l},feGF^'-o) — ^ ((^(l,^))6GF«'-o) • 

A routine computation gives the identities 
and 

Gf,ro{x) = Eh(zvThg{x) = 9- 

Jv 

Also, it is clear that G is Lipschitz with norm OF,ro{^)- The claim then follows from 

(D. ' □ 

Lemma 3.12 (Global polynomiality test can be approximated by local polynomiality 
test). Let k ^ 1, let V be a finite- dimensional vector space, let f : V V be a bounded 
function, and let Vi, V2, ■ ■ ■ & V be an accurate sampling sequence for f of degree k and at 
scales Hi, H2, .... Then for every finite sequence bi, . . . ,bjn & and every continuous 
function F : V"^ — > C, we have 



lEhl,...,/ifeGV / l^hi • • • ^h^g - 1| + Ori-.0;F,m,6i,...,6„,fc(l) 
JV 



for any 1 ^ ri < r2 < ■ ■ ■ < Vk, where g : V ^ C is the function defined by (13. 3p . 
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Proof. Arguing as in Lemma I3.1H we may assume that {bi, . . . ,bm} = ¥ "^o for some 



tq > depending on 61, ... , bm, and that F : — * C is Lipschitz 



Note that to prove the claim we may without loss of generality restrict to the regime 



ri > ro. We now apply ([32]) with G : V'^^'^^ 



being the function 



G 



((^(^^5^))^e{0,l}fc,6GF«'-0 



we{o,i}fc 



[{z{uj,b))l^^Hr^ ) - 1 



where C is again the complex conjugation operator. A routine computation gives the 
identities 



G 



f,ro,ri,...,r^. 



^^aiSF^'-i • • • ^ake¥"-k \^i-v ■ ■ ■ ^^-vQix) - 1| 



and 



for any tq < ri < . . . < r^. Also it is clear that G is Lipschitz with norm OF^ro,k{^)- 
The claim then follows from (13. 2p and the triangle inequality. □ 



Of course, in order to utilise the above lemmas we need to know that such accurate 
sampling sequences in fact exist. This is the purpose of the following proposition. 

Proposition 3.13 (Existence of accurate sampling sequence). Let d ^ 1. Then there 
exists a sequence 

= Ho< Hi< H2< Hi< ... 

of integers such that for every finite- dimensional vector space V and any function f : 
V V, there exists an accurate sampling sequence vi, V2, V3, . . . & V for f of degree d 
at scales Hi, H2, .... 

Remark 3.14. The key point here is that the scales Hi, H2, H^, . . . are universal] they 
depend on d, but otherwise and work for all vector spaces V and functions /. 



Proof. We select Hj recursively by the formula -f/j+i := F{Hj), where F = : N ^ N 
is a sufficiently rapidly growing function depending on d that we will choose later. 

We use the probabilistic method, choosing vi,V2,... € V uniformly at random, and 
showing that (if F was sufficiently rapid) the resulting sequence will be an accurate 
sampling sequence with positive probability. 

We begin with observing that in order to verify the condition (13.21) . it suffices by the 
triangle inequality to show that with positive probability, one has 

/,ro,ri,...,r^, — G'/,ro,ri,...,r^,_ J ^ r (3-4) 



V 
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for all 1 ^ (i' ^ c?, all ^ To < . . . < r^/, and every Lipscliitz function G : T>^^'^^'^^^ ''^ 
C, where 

Gf,ro,ru...,r^Xx) := E^^ ^^h.^ g^^r,, Eft^,^^ V 

(d' d 
(fix + Yl ^^^^ ■ + Yl ^J^J + ^ ■ ^o))(^,,...,^,)g{o,i}^feeF«'-o 

By the union bound, it will suffice to show that for all 1 ^ c?' ^ c? and all ^ Tq < 
... < r^/, with probability 1 — ou ^oo-dHr ... Hr ri(l), (13.40 holds for all Lipschitz 
functions G : p{o,i}'*xF-'o ^ 

since the total failure probability can be made to be 
less than 1 by choosing F to be sufficiently rapid. 

We can normalise G to have Lipschitz norm 1. By the Arzela-Ascoli theorem, the 
space of such functions is compact in the uniform topology. In particular, there exists a 
collection of functions G : V^'^'^^''^^ "^^ — > C of Lipschitz norm 1, S, of size Od,HrQ,ri{^), 
such that any other such Lipschitz function lies within (say) of a function G & S 
in the uniform metric. Because of this, we see from the union bound again that it will 
suffice to show that for all 1 ^ ci' ^ c? and all ^ tq < . . . < rd' , and all functions 
G : p{o.i}'xP'''" ^ C of Lipschitz norm 1 in S, 

\Gf^ro,ri,...,r^, ~ G^/,ro,ri,...,rj,_J ^ 2dri 

of holds with probability 1 - OH,.^,^oo;d,Hro,...,Hr^,_^,rA^)- 

Fix d', To, ... , rd', G. By Markov's inequality, it suffices to show that 

^ / \Gf,ro,ri,...,r^, " G f^ro,ri,...,r^, \ = O//,^, ^oo;^,^,^ ( 1) ; 

by linearity of expectation it thus suffices to show that 

uniformly in x G ^. 
Fix X. We observe that 

G f,ro,ri,...,r^,ix) = K^^^Hr^, fvi,...,VHr^,_^ ' '^d') 

and 

G f^ro,r^_,...,r^,_^{x) = ^h&V fvx,...,VHr , W 



V 



'd'-l 



where fvi,...,VHr : V V is the function 

— 1 

(d'-l d 

(fix + ^^^i ' + + Y ^i^^ + ^ ■ ^o))(^i,...,^,)g{o,i}^6eF«'-o 

i=i j=d'+i 
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As the notation suggests, the function fvi,...,VHr depends on the values ofvi,..., vh^ , 
but not on higher elements of the sequence. Also, as G has Lipschitz norm 1, / takes 
values in T>. The claim now follows from Corollary 13.31 □ 



4. Proof of main theorems 

We are now ready to prove the main theorems. We shall just prove Theorem 11.101 using 
Theorem 11.201 the deduction of Theorem 11.91 using Theorem 11.191 is exactly analogous 
(see the brief remarks at the end of this section) . 

Fix F and d, and let k = C{d) be the quantity in Theorem 11.201 By increasing k if 
necessary we may assume k ^ d. Assume for sake of contradiction that Theorem 11.101 
failed for this choice of F, d, k. Then we can find 5 > and a sequence f^"'^ : V^'-"-* — >■ V 
of functions on finite-dimensional vector spaces V^''^^ such that 

||/(")||^.(v^(„))^5 (4.1) 

for all n, but 

II -cWi, 

lit'=(y(")) 



ll/^"^L^(y(")) = On^oo(l). (4.2) 



We now let F(x) := x, and let 



l<Hi<H2< ... 

be the sequence in Proposition 13. 13( it is important to note that this sequence does not 
depend on n. From that proposition, we can find an accurate sampling sequence 



(n) (n) 



. . . G V^^"^ 



for of degree k at these scales. We fix such a sequence for each 



n. 



We will use these sampling sequences to lift the functions /^'^•' on V^'^'' to a universal 
dynamical system for F"^ by the usual Furstenberg correspondence principle method. 
We begin by constructing this universal space. 

Definition 4.1 (Furstenberg universal space). Let X := T>^'^ be the space of functions 
C, -.W^ ^ v. With the product topology, this is a compact metrisable space, with Borel 
(T-algebra B. It has a continuous action h ^ oi the additive group F"^, defined by 
the formula 

ThCix) ■.= C{x + h). 

We let Pr(X)-^ be the space of all Borel probability measures on X which are invariant 
with respect to this action; note that X = {X,B,fj,, (Th)he¥'^) is a F'^-system for any 
/i e Pt{X)^. If //("^ G Pr(X)-^ is a sequence of such measures, and /i G Pr(X)-^ is 
another measure, we say that yU*^"-* converges vaguely to fi if we have 

hm / 0(c) rf/x(")(c) - / m dfiio 

' X J X 



n— >oo 



for all continuous functions 6 : X ^ <C 
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Because X is compact metrisable, and the action of T is continuous it is a well-known 
fact that Pr(X)"^ is sequentially compact; thus every sequence of measures in Pr(X)-^ 
has a vaguely convergent subsequence whose limit is also in Pr(X)"^. 

For each n, we define a measure /x*^"^ G Pr(X)^ on X by the formula 

where 5 denotes the Dirac mass and for each x G V^^\ Cn,x G X is the function 

C„,.(a) := T^..^.)/(")(^) = T ^ .w/^^H^) 

for all a G F'^ (note the sum on the right-hand side has only finitely many non-zero 
terms). Observe that is indeed T- invariant. By passing to a subsequence if nec- 
essary, we may assume that /i*-"-* converges vaguely to a limit /i G Pr(X)^. We write 
X:= {X,B,fi, {Th)hew-). 

Let f : X ^ V he the indicator function f{() := C(0). We observe the key correspon- 
dence 

/ G{T,J, . . . ,T,J) rf/iW(C) = / G(T,^.,^„)/("), . . . ,T,^.„^,n)/(")) (4.3) 

Jx Jy{") 
for all ai, . . . , Ofc G F"^, all n, and all continuous G : T>^ — > C. 

We now record the (standard) fact that the countable collection of shifts Thf for /i G F'^ 
generate L°°(X): 

Lemma 4.2 {T^f generate L°°(X)). Given any cf) G L°°(X) and e > 0, there exists a 
finite number of shifts /ii, . . . , /i^ G F"^ and a continuous function G : T)^ C such that 

j |0-G(T^^/,...,T^^./)| d^^^e. 



Proof. For continuous 0, the claim follows easily from the Stone- Weierstrass theorem 
(and in this case we can upgrade the L} approximation to L°° approximation). As X 
is compact metrisable, the Borel measure is in fact a Radon measure, and so (by 
Urysohn's lemma) the continuous functions are dense in L°°(X) in the i^^(X) topology, 
and the claim follows. □ 



We can now use the machinery of the previous section to deduce various important facts 
about X and /. For instance. Lemma [3.111 now implies 



Lemma 4.3 (Ergodicity). X is ergodic. 



Proof. By the mean ergodic theorem, it suffices to show that 



lim 

r— >oo 



X 



E^^^HrThg - I g dfx 



X 



dfi = 
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for all g G L°°(X). By Lemma W7]\ and a standard limiting argument it suffices to show 
this for g which are functions of finitely many shifts of /, say g = G{T^_^f, . . . ,T^^f). 
We will then show that 



/ Ef^^^HrTfig - gdfi 
J X J X 



By vague convergence it suffices to show that 



Jx J X 



►oo;G,fc,6i, 



for all n. By (14. 3p . we can rewrite the left-hand side as 



V 



V 



where 



^?(") :=G(Tg^..,„)/("),...,Tg^..,„,/(")). 
But the claim now follows from Lemma [3.111 (and Remark 13.81) . 



□ 



In a similar spirit, Lemma [3.91 implies 

Lemma 4.4 (/ has large Gowers-Host-Kra norm). We have ||/||c/d(x) ^ S. 
Proof. From the mean ergodic theorem we have 



i(x) = limsupE^^gpXi / A,; J dfi 

Ki^oo Jx 



and by induction we have 



y,(x) = lim sup . . . lim sup E^^^^k, . . . ^ji^^^k, / A^^ . . . Aj^J rf/i. 
It thus suffices to show that 

whenever 

1 ^ < . . . < ri. 

By reversing the order of averages, it suffices to show that 

%,6F^'-d • • • ^leF^n / A^^... A^J dfl > 5^' - 0,,^oo(l) 

whenever 

1 ^ ri < . . . < r^. 
Fix ri, . . . , Trf. By weak convergence, it suffices to show that 

for all n. By (14. ip . it suffices to show that 
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By fl4.3l) . left-hand side can be rephrased as 

^n) . . . -Jn) f 

J V 

and the claim now follows from Lemma [3.91 (and Remark IS.Sp . □ 

We have now verified all the hypotheses of Theorem 11.191 Applying that theorem, we 
conclude that ||/||ufc(x) > c for some c > (which could be very small, but positive). 
Thus we can find a phase polynomial (p ^ 'Pa;-i(X) of degree k — 1 such that 

f(j) dfi\ > c. 



X 



Let £ > be a small number (depending on d, k, c) to be chosen later. By Lemma [4. 2 [ 
we can find bi, . . . ,bm ^ (with m potentially quite large, but finite) and a continuous 
G : ^ C such that 

/ \<P-G{T^J,...,^J)\^e. (4.4) 

Since takes values in V, we may assume without loss of generality that G does also. 
If e is small enough depending on c, we thus have 

I / fGmj,...,^J) d/il > c/2. 
Jx 

By vague convergence, we thus have 

I / fG{T^,J,...,T^,J) > c/4 

Jx 

for all sufficiently large n (depending on G,m,c). Using (14. 3p . we rearrange this as 



/(")G(rj^.„^,„,/W, . . . ,Tg^..,„,/W)| > c/4. (4.5) 

Now let ri be a large integer depending on the hi, ... , bm, £, and let rj := ri + (j — 1) 
for j = 2, . . . ,d. Since is a phase polynomial of degree k — 1, we have 



/ \As,...AsJ-l\ dfi = 
Jx 

for all di G F-'^'i , . . . ,dk & F^''^ . From many applications of (14. 4p , the triangle inequality, 
and the boundedness of 0, G, we conclude that 

/ I A,, . . . A,MTiJ, . . . , Tg, J) - 1| rf/x «fc e 
Jx 

for all di G F^''i , . . . , G F-'^''^ . By vague convergence, this implies that 

/ I A,, . . . A„^,G(Tg^/, . . . , T^,J) - 1| «, e 
Jx 

for all sufficiently large n (depending on e, H^^, . . . , Hj.^). Using (14.31) . we can rearrange 
the left-hand side as 

/ I ^ai ■!?(") • • • ^afc-^MG'(Tg^.^(„)/^"\ . . . , Tg^.^M/^"-*) - 1| 
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and so on averaging we obtain 

jy(") 

Applying Lemma 13.121 we conclude (if ri is sufficiently large depending on 61, ... , bm, £) 
that 



Now we invoke a local testability lemma: 

Lemma 4.5 (Polynomiality is locally testable). Let V be a finite- dimensional vector 
space, let k ^ 1, let g : V —>■ V be a bounded function, and suppose that 

Eh,_h,ev [ . . . Ah^g - 1\ ^ e (4.6) 
Jv 

for some e > 0. Then there exists a phase polynomial G Vk-iiV) such that 

\g-(t)\^ O£^0;d(l)- 



For F = F2, this result is essentially in [T] or [SHI Proposition 4.6], but for the convenience 
of the reader (and in view of the subtle difference between phase polynomials and 
polynomials, see Remark 11.31) we give a full proof of this lemma in the appendix. 

Applying this lemma, we conclude that there exists (j)^'^'^ E Vk-i{V^'^^) such that 

/ |G(Tg^ . . . ,Tg^..,„,/W) - 0^1 ^ o,^o;fc(l). 
Jv 

Inserting this into fl4.5p we conclude that 



> c/8 

V 

if e is sufficiently small depending on c,k. But this contradicts (14. 2p . The proof of 
Theorem 11.101 is complete. 

The proof of Theorem 11.91 is identical, but with k now set equal to d, and Theorem II. 191 
used instead of Theorem 11.201 We leave the details to the reader. 

Remark 4.6. It is tempting to try to adapt these arguments to the cyclic setting Z/A^Z, 
in which the role of polynomials is replaced by that of a nilsequence (see [10], [11] for 
further discussion), thus establishing the Inverse conjecture for the Gowers norm for 
Z/A^Z that was formulated in those papers. The analogue of Theorem 11.191 is known, 
see [H] . However, two obstructions remain before one can carry out this program. The 
ffist is to compensate for the rigidity of arithmetic progressions that seems to prevent a 
counterpart of Corollary 13.31 from holding in the cyclic group setting (see Remark 13. 4p . 
The second is that whereas polynomiality is locally testable thanks to Lemma [4.51 it is 
unclear whether the property of being a nilsequence is similarly testable. 
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Appendix A. Proof of Lemma 14.51 



In this appendix we give a proof of Lemma [4.51 following the arguments in and [201 
Proposition 4.6]. We begin with a variant of Lemma [1.21 

Lemma A.l (Discreteness). Let k ^ 0, let V be a finite- dimensional vector space, and 
4> G Vkiy). Then there exists 9 G M/Z and an integer K ^ 1 depending only on ¥ such 
that (j){x) is equal to e^'^*^ times a K^^ root of unity for every x ^ V. 

Proof. See Lemma D.5] (which gives the explicit value K = pL^/pJ+i^ where p is the 
characteristic of F) . □ 

We also have a rigidity lemma. 

Lemma A. 2 (Rigidity). Let k ^ 0, let V be a finite- dimensional vector space, and 
(j) G Vkiy). Suppose that |0 — 1| ^ £ for some e > 0. If e is sufficiently small 
depending on k, ¥, then is constant. 

Proof. We induct on k. For k = the claim is obvious, and for /c = 1 is a linear 
character (times a phase) and the claim can be worked out by hand. Now suppose 
A; ^ 2 and the claim has already been shown for smaller values of k. Since is a phase 
polynomial, we have Aq . . . Ao0 = 1, and thus has unit magnitude. Observe that if 
10 — 1| ^ e, then Jy \Th(f) — 1\ ^ e for every h ^ V. Using the elementary estimate 

|A;,0-1| ^ |0-1| + |T^0-1| 

(using the fact that has unit magnitude) we conclude that 

/ |A;,0- 1| ^ 2e 
Jv 

for every h ^ V. On the other hand, Aft0 G Vk-iiV), so by induction hypothesis (if e 
is small enough) we conclude that Ah4> is constant for all h ^V. Thus G Vi{V), but 
then the claim follows from the k — 1 case. □ 

We now prove Lemma 14.51 The case k = 1 is easy, so suppose that k ^ 2 and the 
claim has already been established for k — 1. To abbreviate the notation we shall write 
o(l) for O£^o;fc(l)- We say that a statement P{x) holds for most a; G ^ if it holds for 
(1 — o(l))|\^| elements of v. 

We fix k, V, f. We may assume that e is small depending on d, as the claim is trivial 
otherwise. From (14. 6 p and Markov's inequality we see that 

^h,,...M.,ev [ I A,, . . . A,,_>,/ - 1| = 0(1) (A.l) 
Jv 
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for most h E V. Let us call h good if flA.ip holds. Applying the induction hypothesis, 
we conclude that for any good h there exist^ (ph G Vk-2{V) such that 

/ |A;,/-0;,| ^O(I). 

Jv 

In particular, this implies (by Markov's inequality) that for all good h, we have 

f{x + h)J{x) = M^)+o{l) 
for most V. Since / is bounded in magnitude by 1, this implies that 

|/(x)| = 1-0(1) 
for most X, and for all good h we have 

f{x + h) = Mx)fi^)+o{l) (A.2) 

for most X. 



We now pause to perform a discretisation trick. Write p := char(F). From repeated 
applications of flA.2p we see that 

fix) = fix + ph) = (l)hix)Mx + h)... (Phix + ip- l)h)fix) + o(l) 

for most X, and thus 

4>hix)(j)hix + h) . . . (phix + ip- l)h) = 1 + o(l) 

for at least one x. On the other hand, from Lemma lA.ll 0/j takes values in e^'^*^ times 
j^th j^QQ^g Qf unity for some fixed K depending only on d,p. Thus e^'^*^^ times a K^^ 
root of unity is within o(l) of 1, and so e^'^*^ lies within o(l) of a pK^^ root of unity. 
Rotating (ph by o(l) if necessary we may assume that e^"^*^ is exactly a pK^^ root of 
unity, and in particular we have 

<pf ^ 1 (A.3) 

whenever h is good. 

Now suppose that hi, h2, h^, are good and form an additive quadruple in the sense 
that /ii + /i2 = ^3 + ^4- Then from flA.2p we see that 

fix + h + /12) = fix)(l)h,ix)(l)h2ix + h) + 0(1) (A.4) 

for most X, and similarly 

fix + /i3 + h4) = / (x) 0/13 (x) 0/14 (x + hs) + 0(1) 

for most X. Since |/(x)| = 1 + o(l) for most x, we conclude the approximate cocycle 
relationship 

0hi(a;)0/i2(a; + /ii)0/,3(x)0/,4(x + /is) = 1 + o(l) 
for most X. In particular, the average of the left-hand side in x is 1 — o(l). Applying 
Lemma IA.2I (and assuming e small enough), we conclude that the left-hand side is 
constant in x; using the discretisation flA.Sp . we conclude (again for e small enough) 
that it is in fact 1. Thus 

(phiix)(ph2ix + hi) = 0/j3(x)0/»4(x + hs) (A. 5) 

for all X and any good additive quadruple hi,h2,h3, h^. 

^This quantity plays the same role that cocycles do in ergodic theory. 



20 TERENCE TAG AND TAMAR ZIEGLER 

Now for any k & V, define the quantity 4j{k) G C by the formula 

^{k) := (Pi^MMhi) (A.6) 

whenever hi,h2,hi + h2 are simultaenously good. Note that the existence of such an 
hi, h2 is guaranteed since most h are good, and flA.Sp ensures that the right-hand side 



of flA.6p does not depend on the exact choice of hi, h2 and so ip is well-defined. From 



flA.3l) we see that ip takes values in the pK^^^ roots of unity, and in particular only has 
0(1) possible values. 

Now let x G V and h be good. Then, since most elements of V are good, we can find 
good ri,r2,Si, S2 such that ri + r2 = x and Si + S2 = x + h. From (lA.4p we see that 

f{y + x) = f{y + ri + ra) = f {y)<prAy)<Pr2{y + ^^i) + o{l) 

and 

f{y + x + h) = f{y + si + S2) = f{y)My)My + Si) + 0(1) 
for most y. Also from (lA.2p we have 

f{y + X + h) = f{y + x)(t)h{,y + x)+ 0(1) 

for most y. Combining these (and the fact that |/(y)| = 1 -l- o(l) for most y) we see 
that 



<PsAy)(t>s2{y + si)(j)rAy)'Pr2{y + ri)(ph{y + x) = i + o(i) 

for most y. Taking expectations and applying Lemma IA.2I and (]A.3p as before, we 
conclude that 



<PsAy)(ps2{y + si)(prAy)<Pr2{y + n)0h(z/ + x) = i 

for all y. Speciahsing to y = and applying flA.6p we conclude that 



Mx) = + h)^{x) = Ah^pix) (A.7) 

for all X G and good h; thus we have succesfully "integrated" (ph- We can then extend 
(l)hix) to 3A\ h E V (not just good h) by viewing (]A.7p as a definition. Observe that if 



h E V, then h = hi + h2 for some good hi, h2, and from (lA.Tp we have 

(phix) = 0/11 (a;) 0/12 (x + hi). 

In particular, since the right-hand side lies in Vk-2{V), the left-hand side does also. 
Thus we see that Ahip G Vk-2iy) for all h eV , and thus Q G Vk-iiV). If we then set 
g{x) := f{x)ip{x), then from flA.2l) . flA.7p we see that for every h E H we have 

g{x + h) = g{x) + o(l) 

for most X. From Fubini's theorem, we thus conclude that there exists an x such that 
g{x + h) = g{x) + o(l) for most h, thus g is almost constant. Since \g{x)\ = 1 + o(l) for 
most X, we thus conclude the existence of a phase 9 G M./Z such that g{x) = e^'^*^ + o(l) 
for most X. We conclude that 

f{x) = e'"'V(a;) + 0(1) 
for most X, and Lemma [4.51 then follows. 
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